Shaping Individualized Impedance Landscapes for Gait Training via Reinforcement Learning

نویسندگان

چکیده

Assist-as-needed (AAN) control aims at promoting therapeutic outcomes in robot-assisted rehabilitation by encouraging patients' active participation. Impedance is used most AAN controllers to create a compliant force field around target motion ensure tracking accuracy while allowing moderate kinematic errors. However, since the parameters governing shape of are often tuned manually or adapted online based on simplistic assumptions about subjects' learning abilities, effectiveness conventional may be limited. In this work, we propose novel adaptive controller that capable autonomously reshaping phase-dependent manner according each individual's motor abilities and task requirements. The proposed consists modified Policy Improvement with Path Integral algorithm, model-free, sampling-based reinforcement method learns subject-specific impedance landscape real-time, hierarchical policy parameter evaluation structure embeds paradigm specifying performance-driven goals. adaptability strategy responses its ability promote short-term adaptations experimentally validated through treadmill training sessions able-bodied subjects who learned altered gait patterns assistance powered ankle-foot orthosis.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A reinforcement learning approach to gait training improves retention

Many gait training programs are based on supervised learning principles: an individual is guided towards a desired gait pattern with directional error feedback. While this results in rapid adaptation, improvements quickly disappear. This study tested the hypothesis that a reinforcement learning approach improves retention and transfer of a new gait pattern. The results of a pilot study and larg...

متن کامل

Learning from Demonstration for Shaping through Inverse Reinforcement Learning

Model-free episodic reinforcement learning problems define the environment reward with functions that often provide only sparse information throughout the task. Consequently, agents are not given enough feedback about the fitness of their actions until the task ends with success or failure. Previous work addresses this problem with reward shaping. In this paper we introduce a novel approach to ...

متن کامل

Reward Shaping in Episodic Reinforcement Learning

Recent advancements in reinforcement learning confirm that reinforcement learning techniques can solve large scale problems leading to high quality autonomous decision making. It is a matter of time until we will see large scale applications of reinforcement learning in various sectors, such as healthcare and cyber-security, among others. However, reinforcement learning can be time-consuming be...

متن کامل

Reinforcement Learning from Demonstration through Shaping

Reinforcement learning describes how a learning agent can achieve optimal behaviour based on interactions with its environment and reward feedback. A limiting factor in reinforcement learning as employed in artificial intelligence is the need for an often prohibitively large number of environment samples before the agent reaches a desirable level of performance. Learning from demonstration is a...

متن کامل

Off-Policy Shaping Ensembles in Reinforcement Learning

Recent advances of gradient temporal-difference methods allow to learn off-policy multiple value functions in parallel without sacrificing convergence guarantees or computational efficiency. This opens up new possibilities for sound ensemble techniques in reinforcement learning. In this work we propose learning an ensemble of policies related through potential-based shaping rewards. The ensembl...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE transactions on medical robotics and bionics

سال: 2022

ISSN: ['2576-3202']

DOI: https://doi.org/10.1109/tmrb.2021.3137971